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Abstract 

Background: Polymerase Chain Reaction (PCR) and Restriction Fragment Lengtli Polymorpliism of PCR products 
(PCR-RFLP) are extensively used molecular biology techniques. An exercise for the design and simulation of PCR 
and PCR-RFLP experiments will be a useful educational tool. 

Findings: An online PCR and PCR-RFLP exercise has been create that requires users to find the target genes, 
compare them, design primers, search for restriction endonucleases, and finally to simulate the experiment. Each 
user of the service is randomly assigned a gene from Escherichia coli; to complete the exercise, users must design 
an experiment capable of distinguishing among E. coli strains. By applying the experimental procedure to all 
completely sequenced E coli, a basic understanding of strain comparison and clustering can also be acquired. 
Comparison of results obtained in different experiments is also very instructive. 

Conclusions: The exercise is freely available at http://insilico.ehu.es/edu. 
Keywords: PCR, PCR-RFLP, Exercise, Primer, Restriction endonuclease 



Findings 

In 1993, Kary B. Mullis was awarded the Nobel Prize in 
Chemistry "for his invention of the polymerase chain reac- 
tion (PCR) method", a technique he started to work on 
only a decade before, in 1983 (http://www.nobelprize.org), 
PCR revolutionized molecular biology, and was promptly 
combined with Restriction Fragment Length Polymorph- 
ism technique to yield PCR-RFLP [1]. Both, PCR and 
PCR-RFLP are widely used molecular biology techniques. 

The purpose of this work was to develop an online learn- 
ing resource for PCR and PCR-RFLP techniques. Some 
programs for PCR simulation are available online [2,3], but 
they have been created primarily for research. At any rate, 
their features do not include PCR-RFLP simulation. Our 
group also created a PCR simulation program available on- 
line since 2003 [4], which includes in its actual version (but 
not in the original one) a PCR-RFLP simulation program 
and programs to simulate other molecular biology tech- 
niques against completely sequenced prokaryotes. The ser- 
vice described in this work facilitates "learning by doing" 
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through simulation of all the steps required to complete 
PCR and PCR-RFLP experiments, and may be considered 
introductory to the molecular biology simulation programs 
available on our website. To our knowledge, no integrated 
service similar to the one described in this work is available 
online or as a downloadable program. 

Usage of the online exercise does not require registra- 
tion, and several users may access the site simultan- 
eously (for example, a class of students in a computer 
room). All the exercises may be completed without leav- 
ing the server, and the exercise is available in English, 
Spanish, and Basque. A video and a presentation explain 
how to complete the exercises and outline the basics of 
the simulated techniques. 

The service will generate for each user/session a PCR 
(short exercise) or PCR-RFLP (long exercise) problem: 
one out of 750 pre-selected genes present in two strains 
of the model organism E. coli is assigned to the user. To 
solve the PCR exercise, users must retrieve the selected 
gene sequences, align and compare their DNA and pro- 
tein sequences, design primers, select the reagents for 
the PCR reaction, and simulate it. The PCR exercise is 
solved when bands are observed for both genomes in 
the virtual agarose gel. If the long exercise is chosen, se- 
quences of the two amplicons obtained in the PCR 
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l.PCR results 


3. Visualization of bands in agarose gel 
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Figure 1 Basic output of tiie exercise. During the exercise, PCR amplification is carried out for botli problem genomes with primers selected 
by the users (GGGCGTGATCCA^mATG and CTAmGCGCG^mGACA) for problem gene (ymdA) (1). In the PCR-RFLP exercise, after selection of 
the restriction endonuclease, different cleavage patterns are expected for amplicons (2), and the bands yielded will be shown in a simulated 
gel (3). 
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Figure 2 Clustering of PCR amplicons. Sequences from all amplicons are compared and clustered. To allow a fast response to several 
simultaneous users (as for example to students in a computers room) the amplicons with identical sequences are grouped and identified with a 
letter. In this example, amplicons obtained with primers GGGCGTGATCCA I I I I I ATG and CTAmGCGCG I I I I I GACA from all 57 sequenced E. coli 
genomes are grouped into 16 groups with identical sequences (A to P). 
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experiment must be searched for restriction endonucle- 
ases that show different cleavage patterns. The selected re- 
striction endonuclease is used for simulation of the PGR- 
RFLP exercise. The exercise is solved when different bands 
patterns are obtained for the two genomes (Figure 1). At 
the end of the exercise, all the information generated dur- 
ing the experiment can be retrieved, including the se- 
quences of the amplicons. 

Additionally, the program allows the simulation of PGR 
and PGR-RFLP techniques for all complete E. coli genomes 
available through the International Nucleotide Sequence 
Database (http://www.insdc.org/), as well as the compari- 
son of the results. For PGR, the amplicons obtained for E, 
coli genomes may be aligned, compared, and clustered, and 
a dendrogram is generated (Figure 2). When two or more 
simulations are performed (for example, by several stu- 
dents) the results obtained in different experiments may be 
compared. As PGR-RFLP results and dendrograms ob- 
tained for each gene/experiment differ considerably, diver- 
gent results may be used to emphasize the importance of 
proper gene selection to allow clustering patterns obtained 
in the dendrograms to be associated with phylogenetic re- 
lationships among bacterial strains, as might be necessary, 
for example, during an outbreak. 

The exercise s educational benefit stems from the fact 
that one needs to understand the crucial steps and pro- 
cedures carried out during the exercise in order to 
complete it, and relies upon the availability of sequences 
on the internet, as well as programs for alignment, pri- 
mer design, and comparison of endonuclease restriction 
patterns or clustering. Multiple-choice questions were 
generated as a companion to the service. 

Implementation 

The website was developed with open-source software 
(PHP running on an Apache server using the Linux oper- 
ating system). Sequences from prokaryotic genomes and 
other related files were obtained from NGBI [5], and data 
related to restriction endonucleases from REBASE [6]. 

The open-source programs GlustalW [7] and PrimerS 
[8] were integrated in the service for alignment and 
primer design respectively. For comparison of restriction 
endonuclease patterns, and computing of distances and 
clustering of amplicons, specialized scripts were devel- 
oped. To retrieve sequences and other data from ampli- 
cons, the service was integrated with the actualized 
version of the web service developed by Bikandi et al. 
[4]. To allow fast responses to several simultaneous users, 
most data used by the service were pre-computed. 

Availability and requirements 

The PGR exercise is available over the Internet at http:// 
insilico.ehu.es/edu. Its correct behavior was verified with 
all major browsers. 



The script to search for commercially available restric- 
tion endonucleases capable of generating different re- 
striction patterns for the amplicons obtained in the 
experiment was integrated with a previous script devel- 
oped for the restriction of single sequences [4] and a 
multipurpose independent downloadable PHP program 
was generated (http://insilico.ehu.es/restriction/main/). 

Conclusions 

An online exercise for the design and simulation of PGR 
and PGR-RFLP experiments was created. At the end of 
the exercise, the designed procedures may be applied to 
all sequenced E, coli genomes. The exercise may be com- 
pleted individually or simultaneously by students present 
in a computer room. The latter option also allows the com- 
parison of results obtained in different experiments, so the 
importance of gene selection prior to inference of phylo- 
genetic relationships among strains may be demonstrated. 
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